Displaying Representations of Environments

ABSTRACT

A method includes displaying a home ER environment characterized by home ER world coordinates, including one or more diorama-view representation of one or more respective ER environments. Each diorama-view representation includes ER objects arranged in a spatial relationship according to corresponding ER world coordinates. In some implementations, in response to detecting an input directed to a first diorama-view representation, the method includes transforming the home ER environment. Transforming the home ER environment includes transforming the spatial relationship between a subset of the ER objects as a function of the home ER world coordinates and corresponding ER world coordinates. In some implementations, in response to detecting an input associated with a first one of a plurality of diorama-view representations, the method includes changing display of the first one of the plurality of diorama-view representations from a first viewing vector to a second viewing vector while maintaining an arrangement of ER objects.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is entitled to the benefit of International App No.PCT/US2020/047186, filed on Aug. 20, 2020, which is entitled to thebenefit of the filing date of U.S. Provisional Patent App. No.62/905,692, filed on Sep. 25, 2019, both of which are herebyincorporated by reference in their entirety.

TECHNICAL FIELD

The present disclosure relates to displaying content, and, inparticular, displaying respective representations of environments.

BACKGROUND

A previously available device may display content within an operatingenvironment. In augmented reality (AR) applications, the operatingenvironment corresponds to a physical (e.g. real-world) environmentincluding physical objects, and the previously available device displaysvirtual objects overlaid on a graphical representation of the physicalenvironment. In virtual reality (VR) applications, the operatingenvironment corresponds to a virtual environment including purelyvirtual objects.

However, the device lacks a mechanism for enabling three-dimensional(3D) manipulation of the displayed content within the operatingenvironment. The device also lacks a mechanism for displaying virtualobjects such that the virtual objects and physical objects fromoccluding each other within the operating environment.

SUMMARY

In accordance with some implementations, a method is performed at anelectronic device with one or more processors, a non-transitory memory,one or more input devices, and a display device. The method includesdisplaying, via the display device, a home enhanced reality (ER)environment characterized by home ER world coordinates, including afirst diorama-view representation of a first ER environment. The firstdiorama-view representation includes one or more of ER objects arrangedin a spatial relationship according to first ER world coordinates. Themethod includes detecting, via the one or more input devices, a firstinput that is directed to the first diorama-view representation. Themethod includes, in response to detecting the first input, transformingthe home ER environment by: ceasing to display the first diorama-viewrepresentation within the home ER environment, transforming the spatialrelationship between a subset of the one or more ER objects as afunction of the home ER world coordinates and the first ER worldcoordinates, and displaying, via the display device, the subset of theone or more ER objects within the home ER environment based on thetransformation.

In accordance with some implementations, a method is performed at anelectronic device with one or more processors, a non-transitory memory,one or more input devices, and a display device. The method includesdisplaying, via the display device, a plurality of diorama-viewrepresentations from a corresponding plurality of viewing vectors. Theplurality of diorama-view representations corresponds to a plurality ofenhanced reality (ER) environments. Each of the plurality ofdiorama-view representations is associated with a respective set of ERworld coordinates that characterizes a respective ER environment. Theplurality of diorama-view representations includes a first one of theplurality of diorama-view representations displayed from a first viewingvector. The first one of the plurality of diorama-view representationsincludes a first one or more of ER objects arranged according to a firstset of ER world coordinates. The method includes detecting, via the oneor more input devices, an input associated with the first one of theplurality of diorama-view representations. The method includes, inresponse to detecting the input, changing display of the first one ofthe plurality of diorama-view representations from the first viewingvector to a second viewing vector while maintaining the first one ormore ER objects arranged according to the first set of ER worldcoordinates.

In accordance with some implementations, an electronic device includesone or more processors, a non-transitory memory, one or more inputdevices, and a display device. The one or more programs are stored inthe non-transitory memory and configured to be executed by the one ormore processors and the one or more programs include instructions forperforming or causing performance of the operations of any of themethods described herein. In accordance with some implementations, anon-transitory computer readable storage medium has stored thereininstructions which when executed by one or more processors of anelectronic device, cause the device to perform or cause performance ofthe operations of any of the methods described herein. In accordancewith some implementations, an electronic device includes means forperforming or causing performance of the operations of any of themethods described herein. In accordance with some implementations, aninformation processing apparatus, for use in an electronic device,includes means for performing or causing performance of the operationsof any of the methods described herein.

BRIEF DESCRIPTION OF THE DRAWINGS

For a better understanding of the various described implementations,reference should be made to the Description, below, in conjunction withthe following drawings in which like reference numerals refer tocorresponding parts throughout the figures.

FIG. 1 is a block diagram of an example of a portable multifunctiondevice in accordance with some implementations.

FIGS. 2A-2V are examples of an operating environment in accordance withsome implementations.

FIG. 3 is a flow diagram of a method of changing a perspective view of adiorama-view representation of an ER environment based on a user inputin accordance with some implementations.

FIG. 4 is a flow diagram of a method of transforming a spatialrelationship between a subset of ER objects as a function of respectivecoordinates associated with a corresponding diorama-view representationand home ER world coordinates in accordance with some implementations.

SUMMARY

In some circumstances, a device may display one or more objects withinan operating environment. In some applications, the device modifiesdisplay of a current operating environment. Modifying the currentoperating environment may include displaying different (e.g., new)virtual objects at respective predetermined locations within the currentoperating environment, independent of user input. Displaying a virtualobject at a predetermined location within the current operatingenvironment is problematic because the virtual object and a physical(e.g., real-world) object that is also within the current operatingenvironment may obstruct (e.g., occlude) each other, degrading thefunctionality provided by the current operating environment.

By contrast, various implementations include systems, methods, andelectronic devices that, in response to detecting a first input directedto a first diorama-view representation of a first ER environment,transforming a subset of ER objects included in the first diorama-viewrepresentation into the home ER environment as a function of home ERworld coordinates and first ER world coordinates. In someimplementations, the method includes moving the subset of ER objects inresponse to detecting a second input. For example, in someimplementations, the method includes moving the subset of ER objectsrelative to physical objects within the home ER environment.Accordingly, occlusions (e.g., obstructions) between physical objectsand ER objects are negated, enabling a richer set of functionality withrespect to the home ER environment.

In some circumstances, a device may display content within an operatingenvironment. For example, in conventional video conference applications,the operating environment corresponds to a shared communicationenvironment. The device displays remotely-located individuals associatedwith the shared communication environment based on respective recordedvideo streams of the individuals. Accordingly, the individuals maygraphically interact with each other. The device displays the contentwithin the operating environment in two-dimensions (2D), such as a flat2D video stream associated with a video conference. However, the devicelacks a mechanism for enabling three-dimensional (3D) manipulation ofthe displayed content within the operating environment.

By contrast, various implementations include methods, systems, andelectronic devices that enable changing a perspective view of a firstone of a plurality of displayed diorama-view representations based on auser input. The plurality of diorama-view representations corresponds toa plurality of enhanced reality (ER) environments. Each of the pluralityof diorama-view representations is associated with a respective set ofER world coordinates that characterizes a respective ER environment.Based on the user input, an electronic device changes the perspectiveview of the first one of the plurality of diorama-view representationswhile maintaining the previous arrangement of ER objects thereinaccording to a respective set of ER world coordinates. Accordingly, thefirst one of the plurality of diorama-view representations may bemanipulated from a 3D perspective, enabling a richer set offunctionality with respect to the ER environments.

DESCRIPTION

Reference will now be made in detail to implementations, examples ofwhich are illustrated in the accompanying drawings. In the followingdetailed description, numerous specific details are set forth in orderto provide a thorough understanding of the various describedimplementations. However, it will be apparent to one of ordinary skillin the art that the various described implementations may be practicedwithout these specific details. In other instances, well-known methods,procedures, components, circuits, and networks have not been describedin detail so as not to unnecessarily obscure aspects of theimplementations.

It will also be understood that, although the terms first, second, etc.are, in some instances, used herein to describe various elements, theseelements should not be limited by these terms. These terms are only usedto distinguish one element from another. For example, a first contactcould be termed a second contact, and, similarly, a second contact couldbe termed a first contact, without departing from the scope of thevarious described implementations. The first contact and the secondcontact are both contacts, but they are not the same contact, unless thecontext clearly indicates otherwise.

The terminology used in the description of the various describedimplementations herein is for the purpose of describing particularimplementations only and is not intended to be limiting. As used in thedescription of the various described implementations and the appendedclaims, the singular forms “a”, “an”, and “the” are intended to includethe plural forms as well, unless the context clearly indicatesotherwise. It will also be understood that the term “and/or” as usedherein refers to and encompasses any and all possible combinations ofone or more of the associated listed items. It will be furtherunderstood that the terms “includes”, “including”, “comprises”, and/or“comprising”, when used in this specification, specify the presence ofstated features, integers, steps, operations, elements, and/orcomponents, but do not preclude the presence or addition of one or moreother features, integers, steps, operations, elements, components,and/or groups thereof.

As used herein, the term “if” is, optionally, construed to mean “when”or “upon” or “in response to determining” or “in response to detecting”,depending on the context. Similarly, the phrase “if it is determined” or“if [a stated condition or event] is detected” is, optionally, construedto mean “upon determining” or “in response to determining” or “upondetecting [the stated condition or event]” or “in response to detecting[the stated condition or event]”, depending on the context.

Various examples of electronic systems and techniques for using suchsystems in relation to various enhanced reality technologies aredescribed.

A physical setting refers to a world with which various persons cansense and/or interact without use of electronic systems. Physicalsettings, such as a physical park, include physical elements, such as,for example, physical wildlife, physical trees, and physical plants.Persons can directly sense and/or otherwise interact with the physicalsetting, for example, using one or more senses including sight, smell,touch, taste, and hearing.

An enhanced reality (ER) setting or a computer-generated reality (CGR)setting, in contrast to a physical setting, refers to an entirely (orpartly) computer-produced setting that various persons, using anelectronic system, can sense and/or otherwise interact with. In ER/CGR,a person's movements are in part monitored, and, responsive thereto, atleast one attribute corresponding to at least one virtual object in theER/CGR setting is changed in a manner that is consistent with one ormore physical laws. For example, in response to an ER/CGR systemdetecting a person looking upward, the ER/CGR system may adjust variousaudio and graphics presented to the person in a manner consistent withhow such sounds and appearances would change in a physical setting.Adjustments to attribute(s) of virtual object(s) in an ER/CGR settingalso may be made, for example, in response to representations ofmovement (e.g., voice commands).

A person may sense and/or interact with an ER/CGR object using one ormore senses, such as sight, smell, taste, touch, and sound. For example,a person may sense and/or interact with objects that create amulti-dimensional or spatial acoustic setting. Multi-dimensional orspatial acoustic settings provide a person with a perception of discreteacoustic sources in multi-dimensional space. Such objects may alsoenable acoustic transparency, which may selectively incorporate audiofrom a physical setting, either with or without computer-produced audio.In some ER/CGR settings, a person may sense and/or interact with onlyacoustic objects.

Virtual reality (VR) is one example of ER/CGR. A VR setting refers to anenhanced setting that is configured to only include computer-producedsensory inputs for one or more senses. A VR setting includes a pluralityof virtual objects that a person may sense and/or interact with. Aperson may sense and/or interact with virtual objects in the VR settingthrough a simulation of at least some of the person's actions within thecomputer-produced setting, and/or through a simulation of the person orher presence within the computer-produced setting.

Mixed reality (MR) is another example of ER/CGR. An MR setting refers toan enhanced setting that is configured to integrate computer-producedsensory inputs (e.g., virtual objects) with sensory inputs from thephysical setting, or a representation of sensory inputs from thephysical setting. On a reality spectrum, an MR setting is between, butdoes not include, a completely physical setting at one end and a VRsetting at the other end.

In some MR settings, computer-produced sensory inputs may be adjustedbased on changes to sensory inputs from the physical setting. Moreover,some electronic systems for presenting MR settings may detect locationand/or orientation with respect to the physical setting to enableinteraction between real objects (i.e., physical elements from thephysical setting or representations thereof) and virtual objects. Forexample, a system may detect movements and adjust computer-producedsensory inputs accordingly, so that, for example, a virtual tree appearsfixed with respect to a physical structure.

Augmented reality (AR) is an example of MR. An AR setting refers to anenhanced setting where one or more virtual objects are superimposed overa physical setting (or representation thereof). As an example, anelectronic system may include an opaque display and one or more imagingsensors for capturing video and/or images of a physical setting. Suchvideo and/or images may be representations of the physical setting, forexample. The video and/or images are combined with virtual objects,wherein the combination is then displayed on the opaque display. Thephysical setting may be viewed by a person, indirectly, via the imagesand/or video of the physical setting. The person may thus observe thevirtual objects superimposed over the physical setting. When a systemcaptures images of a physical setting, and displays an AR setting on anopaque display using the captured images, the displayed images arecalled a video pass-through. Alternatively, a transparent orsemi-transparent display may be included in an electronic system fordisplaying an AR setting, such that an individual may view the physicalsetting directly through the transparent or semi-transparent displays.Virtual objects may be displayed on the semi-transparent or transparentdisplay, such that an individual observes virtual objects superimposedover a physical setting. In yet another example, a projection system maybe utilized in order to project virtual objects onto a physical setting.For example, virtual objects may be projected on a physical surface, oras a holograph, such that an individual observes the virtual objectssuperimposed over the physical setting.

An AR setting also may refer to an enhanced setting in which arepresentation of a physical setting is modified by computer-producedsensory data. As an example, at least a portion of a representation of aphysical setting may be graphically modified (e.g., enlarged), so thatthe modified portion is still representative of (although not afully-reproduced version of) the originally captured image(s).Alternatively, in providing video pass-through, one or more sensorimages may be modified in order to impose a specific viewpoint differentthan a viewpoint captured by the image sensor(s). As another example,portions of a representation of a physical setting may be altered bygraphically obscuring or excluding the portions.

Augmented virtuality (AV) is another example of MR. An AV setting refersto an enhanced setting in which a virtual or computer-produced settingintegrates one or more sensory inputs from a physical setting. Suchsensory input(s) may include representations of one or morecharacteristics of a physical setting. A virtual object may, forexample, incorporate a color associated with a physical element capturedby imaging sensor(s). Alternatively, a virtual object may adoptcharacteristics consistent with, for example, current weather conditionscorresponding to a physical setting, such as weather conditionsidentified via imaging, online weather information, and/orweather-related sensors. As another example, an AR park may includevirtual structures, plants, and trees, although animals within the ARpark setting may include features accurately reproduced from images ofphysical animals.

Various systems allow persons to sense and/or interact with ER/CGRsettings. For example, a head mounted system may include one or morespeakers and an opaque display. As another example, an external display(e.g., a smartphone) may be incorporated within a head mounted system.The head mounted system may include microphones for capturing audio of aphysical setting, and/or image sensors for capturing images/video of thephysical setting. A transparent or semi-transparent display may also beincluded in the head mounted system. The semi-transparent or transparentdisplay may, for example, include a substrate through which light(representative of images) is directed to a person's eyes. The displaymay also incorporate LEDs, OLEDs, liquid crystal on silicon, a laserscanning light source, a digital light projector, or any combinationthereof. The substrate through which light is transmitted may be anoptical reflector, holographic substrate, light waveguide, opticalcombiner, or any combination thereof. The transparent orsemi-transparent display may, for example, transition selectivelybetween a transparent/semi-transparent state and an opaque state. Asanother example, the electronic system may be a projection-based system.In a projection-based system, retinal projection may be used to projectimages onto a person's retina. Alternatively, a projection-based systemalso may project virtual objects into a physical setting, for example,such as projecting virtual objects as a holograph or onto a physicalsurface. Other examples of ER/CGR systems include windows configured todisplay graphics, headphones, earphones, speaker arrangements, lensesconfigured to display graphics, heads up displays, automotivewindshields configured to display graphics, input mechanisms (e.g.,controllers with or without haptic functionality), desktop or laptopcomputers, tablets, or smartphones.

FIG. 1 is a block diagram of an example of a portable multifunctiondevice 100 (sometimes also referred to herein as the “electronic device100” for the sake of brevity) in accordance with some implementations.The electronic device 100 includes memory 102 (which optionally includesone or more computer readable storage mediums), a memory controller 122,one or more processing units (CPUs) 120, a peripherals interface 118, aninput/output (I/O) subsystem 106, a speaker 111, a touch-sensitivedisplay system 112, an inertial measurement unit (IMU) 130, imagesensor(s) 143 (e.g., camera), contact intensity sensor(s) 165, audiosensor(s) 113 (e.g., microphone), eye tracking sensor(s) 164 (e.g.,included within a head-mountable device (HMD)), a hand tracking sensor150, a body pose sensor 190, and other input or control device(s) 116.In some implementations, the electronic device 100 corresponds to one ofa mobile phone, tablet, laptop, wearable computing device,head-mountable device (HMD), head-mountable enclosure (e.g. theelectronic device 100 slides into or otherwise attaches to ahead-mountable enclosure), or the like. In some implementations, thehead-mountable enclosure is shaped to form a receptacle for receivingthe electronic device 100 with a display.

In some implementations, the peripherals interface 118, the one or moreprocessing units 120, and the memory controller 122 are, optionally,implemented on a single chip, such as a chip 103. In some otherimplementations, they are, optionally, implemented on separate chips.

The I/O subsystem 106 couples input/output peripherals on the electronicdevice 100, such as the touch-sensitive display system 112 and the otherinput or control devices 116, with the peripherals interface 118. TheI/O subsystem 106 optionally includes a display controller 156, an imagesensor controller 158, an intensity sensor controller 159, an audiocontroller 157, an eye tracking controller 162, one or more inputcontrollers 160 for other input or control devices, an IMU controller132, a hand tracking controller 180, a body pose controller 195, and aprivacy subsystem 170. The one or more input controllers 160receive/send electrical signals from/to the other input or controldevices 116. The other input or control devices 116 optionally includephysical buttons (e.g., push buttons, rocker buttons, etc.), dials,slider switches, joysticks, click wheels, and so forth. In somealternate implementations, the one or more input controllers 160 are,optionally, coupled with any (or none) of the following: a keyboard,infrared port, Universal Serial Bus (USB) port, stylus, and/or a pointerdevice such as a mouse. The one or more buttons optionally include anup/down button for volume control of the speaker 111 and/or audiosensor(s) 113. The one or more buttons optionally include a push button.In some implementations, the other input or control devices 116 includesa positional system (e.g., GPS) that obtains information concerning thelocation and/or orientation of the electronic device 100 relative to aphysical environment.

The touch-sensitive display system 112 provides an input interface andan output interface between the electronic device 100 and a user. Thedisplay controller 156 receives and/or sends electrical signals from/tothe touch-sensitive display system 112. The touch-sensitive displaysystem 112 displays visual output to the user. The visual outputoptionally includes graphics, text, icons, video, and any combinationthereof (collectively termed “graphics”). In some implementations, someor all of the visual output corresponds to user interface objects. Asused herein, the term “affordance” refers to a user-interactivegraphical user interface object (e.g., a graphical user interface objectthat is configured to respond to inputs directed toward the graphicaluser interface object). Examples of user-interactive graphical userinterface objects include, without limitation, a button, slider, icon,selectable menu item, switch, hyperlink, or other user interfacecontrol.

The touch-sensitive display system 112 has a touch-sensitive surface,sensor, or set of sensors that accepts input from the user based onhaptic and/or tactile contact. The touch-sensitive display system 112and the display controller 156 (along with any associated modules and/orsets of instructions in the memory 102) detect contact (and any movementor breaking of the contact) on the touch-sensitive display system 112and converts the detected contact into interaction with user-interfaceobjects (e.g., one or more soft keys, icons, web pages or images) thatare displayed on the touch-sensitive display system 112. In an exampleimplementation, a point of contact between the touch-sensitive displaysystem 112 and the user corresponds to a finger of the user or a stylus.

The touch-sensitive display system 112 optionally uses LCD (liquidcrystal display) technology, LPD (light emitting polymer display)technology, or LED (light emitting diode) technology, although otherdisplay technologies are used in other implementations. Thetouch-sensitive display system 112 and the display controller 156optionally detect contact and any movement or breaking thereof using anyof a plurality of touch sensing technologies now known or laterdeveloped, including but not limited to capacitive, resistive, infrared,and surface acoustic wave technologies, as well as other proximitysensor arrays or other elements for determining one or more points ofcontact with the touch-sensitive display system 112.

The user optionally makes contact with the touch-sensitive displaysystem 112 using any suitable object or appendage, such as a stylus, afinger, and so forth. In some implementations, the user interface isdesigned to work with finger-based contacts and gestures, which can beless precise than stylus-based input due to the larger area of contactof a finger on the touch screen. In some implementations, the electronicdevice 100 translates the rough finger-based input into a precisepointer/cursor position or command for performing the actions desired bythe user.

The speaker 111 and the audio sensor(s) 113 provide an audio interfacebetween a user and the electronic device 100. Audio circuitry receivesaudio data from the peripherals interface 118, converts the audio datato an electrical signal, and transmits the electrical signal to thespeaker 111. The speaker 111 converts the electrical signal tohuman-audible sound waves. Audio circuitry also receives electricalsignals converted by the audio sensors 113 (e.g., a microphone) fromsound waves. Audio circuitry converts the electrical signal to audiodata and transmits the audio data to the peripherals interface 118 forprocessing. Audio data is, optionally, retrieved from and/or transmittedto the memory 102 and/or RF circuitry by the peripherals interface 118.In some implementations, audio circuitry also includes a headset jack.The headset jack provides an interface between audio circuitry andremovable audio input/output peripherals, such as output-only headphonesor a headset with both output (e.g., a headphone for one or both ears)and input (e.g., a microphone).

The inertial measurement unit (IMU) 130 includes accelerometers,gyroscopes, and/or magnetometers in order measure various forces,angular rates, and/or magnetic field information with respect to theelectronic device 100. Accordingly, according to variousimplementations, the IMU detects one or more positional change inputs ofthe electronic device 100, such as the electronic device 100 beingshaken, rotated, moved in a particular direction, and/or the like.

The image sensor(s) 143 capture still images and/or video. In someimplementations, an image sensor 143 is located on the back of theelectronic device 100, opposite a touch screen on the front of theelectronic device 100, so that the touch screen is enabled for use as aviewfinder for still and/or video image acquisition. In someimplementations, another image sensor 143 is located on the front of theelectronic device 100 so that the user's image is obtained (e.g., forselfies, for videoconferencing while the user views the other videoconference participants on the touch screen, etc.). In someimplementations, the image sensor(s) are integrated within a HMD.

The contact intensity sensors 165 detect intensity of contacts on theelectronic device 100 (e.g., a touch input on a touch-sensitive surfaceof the electronic device 100). The contact intensity sensors 165 arecoupled with the intensity sensor controller 159 in the I/O subsystem106. The contact intensity sensor(s) 165 optionally include one or morepiezoresistive strain gauges, capacitive force sensors, electric forcesensors, piezoelectric force sensors, optical force sensors, capacitivetouch-sensitive surfaces, or other intensity sensors (e.g., sensors usedto measure the force (or pressure) of a contact on a touch-sensitivesurface). The contact intensity sensor(s) 165 receive contact intensityinformation (e.g., pressure information or a proxy for pressureinformation) from the physical environment. In some implementations, atleast one contact intensity sensor 165 is collocated with, or proximateto, a touch-sensitive surface of the electronic device 100. In someimplementations, at least one contact intensity sensor 165 is located onthe back of the electronic device 100.

The eye tracking sensor(s) 164 detect eye gaze of a user of theelectronic device 100 and generate eye tracking data indicative of theeye gaze of the user. In various implementations, the eye tracking dataincludes data indicative of a fixation point (e.g., point of regard) ofthe user on a display panel, such as a display panel within ahead-mountable device (HMD), a head-mountable enclosure, or within aheads-up display.

The hand tracking sensor 150 obtains hand tracking data indicative of aposition of a hand of a user. In various implementations, the electronicdevice 100 utilizes the hand tracking data in order to manipulatedisplay of a diorama-view representation of an ER environment. Forexample, in some implementations, the electronic device 100 moves thediorama-view representation in order to track the current position ofthe hand of the user. As another example, the electronic device

The body pose sensor 190 obtains body pose data indicative of a positionof a head or body of a user. In various implementations, the electronicdevice 100 utilizes the body pose data in order to manipulate display ofa diorama-view representation of an ER environment. For example, thebody pose data indicates the user is turning his/her head sideways, andthe electronic device 100 accordingly manipulates the perspective viewof the diorama-view representation of the ER environment.

In various implementations, the electronic device 100 includes a privacysubsystem 170 that includes one or more privacy setting filtersassociated with user information, such as user information included inthe eye gaze data and/or body position data associated with a user. Insome implementations, the privacy subsystem 170 selectively preventsand/or limits the electronic device 100 or portions thereof fromobtaining and/or transmitting the user information. To this end, theprivacy subsystem 170 receives user preferences and/or selections fromthe user in response to prompting the user for the same. In someimplementations, the privacy subsystem 170 prevents the electronicdevice 100 from obtaining and/or transmitting the user informationunless and until the privacy subsystem 170 obtains informed consent fromthe user. In some implementations, the privacy subsystem 170 anonymizes(e.g., scrambles or obscures) certain types of user information. Forexample, the privacy subsystem 170 receives user inputs designatingwhich types of user information the privacy subsystem 170 anonymizes. Asanother example, the privacy subsystem 170 anonymizes certain types ofuser information likely to include sensitive and/or identifyinginformation, independent of user designation (e.g., automatically). Asyet another example, the privacy system 170 denies access to a firstindividual having a first access level that does not satisfy an accesslevel criterion that is associated with an ER session, and allows accessto a second individual having a second access level that satisfies theaccess level criterion.

FIGS. 2A-2V are examples of an operating environment 200 in accordancewith some implementations. While pertinent features are shown, those ofordinary skill in the art will appreciate from the present disclosurethat various other features have not been illustrated for the sake ofbrevity and so as not to obscure more pertinent aspects of the exampleimplementations disclosed herein.

To that end, as illustrated in FIG. 2A, the operating environment 200includes an electronic device 203, which is being held by a user 210,and a physical table 220. In some implementations, the electronic device203 corresponds to a mobile device, such as a smartphone, laptop,tablet, etc. In some implementations, the electronic device 203 issimilar to and adapted from the electronic device 100 in FIG. 1. Theelectronic device 203 is associated with a field-of-view 204 associatedwith a portion of the operating environment 200.

According to various implementations, the electronic device 203 isconfigured to present a home ER environment 206 to the user 210. Thehome ER environment 206 is characterized by home ER world coordinates208. In some implementations, the home ER world coordinates 208 define aset of points existing in a three-dimensional (3D) space. In someimplementations, the home ER environment 206 corresponds to a purelyvirtual environment, and thus the electronic device 203 does not displaythe physical table 220 within the home ER environment 206, irrespectiveof the position of the electronic device 203 relative to the operatingenvironment 200.

In some implementations, the home ER environment 206 corresponds to anAR environment. For example, in some implementations, the electronicdevice 203 is configured to present the home ER environment 206 and toenable video pass-through of at least a portion of physical objectswithin the operating environment 200 therein. As one example, withreference to FIG. 2F, after the user 210 wearing an electronic device203 has rotated the orientation of the electronic device 203 towards thephysical table 220, the electronic device 203 displays the physicaltable 220 within the home ER environment 206. As another example, insome implementations, the electronic device 203 includes a transparentor additive display that enables optical see-through of the operatingenvironment 200 including physical objects within the operatingenvironment 200.

The home ER environment 206 includes a first diorama-view representation230 of a first ER environment and a second diorama-view representation240 of a second ER environment. In some implementations, a diorama-viewrepresentation is representative of a purely virtual environment. On theother hand, in some implementations, a diorama-view representation is atleast partially representative of a real environment. For example, insome implementations, a diorama-view representation includes structurefeatures of a real environment, such as the walls, floor, and ceiling ofa user's living room. As another example, in some implementations, adiorama-view representation includes physical objects that are withinvirtual structural features, such as a real table inside a virtual room.

The first diorama-view representation 230 includes one or more of ERobjects arranged in a spatial relationship according to first ER worldcoordinates 232. For example, the first ER world coordinates 232 definea respective set of points existing in a 3D space. Namely, the firstdiorama-view representation 230 includes a television 234, an avatar236, and a couch 238. The avatar 236 corresponds to a virtualrepresentation of an individual that is associated with (e.g., connectedto) the first ER environment. For example, in some implementations, theavatar 236 is associated with an ER session that enables graphicalrepresentations of individuals to be displayed within the first ERenvironment. In some implementations, the electronic device 203dynamically updates the first diorama-view representation 230, such asplayback of video content via the television 234 or movement of theavatar 236, in order to reflect changes within the first ER environment.

The second diorama-view representation 240 includes one or more of ERobjects arranged in a spatial relationship according to second ER worldcoordinates 242. Namely, the second diorama-view representation 240includes a credenza 244 and a chair 246. For example, the second ERworld coordinates 242 define a respective set of points existing in a 3Dspace. One of ordinary skill in the art will appreciate that thediorama-view representations may include any number and types of ERobjects arranged in a variety of ways. One of ordinary skill in the artwill appreciate that the diorama-view representations may include anynumber of avatars.

In some implementations, the electronic device 203 corresponds to ahead-mountable device (HMD) being worn by the user 210. The HMD presentsthe home ER environment 206 described above with reference to FIG. 2A.In some implementations, the HMD includes an image sensor that isassociated with the field-of-view 204, and the HMD displays, within thehome ER environment 206, AR content that includes physical objects. Insome implementations, the HMD is similar to and adapted from theelectronic device 100 in FIG. 1. In some implementations, the HMD isconfigured to present the home ER environment 206 and to enable videopass-through of at least a portion of physical objects within theoperating environment 200 therein. As another example, in someimplementations, the HMD includes a transparent or additive displayenables optical see-through of the operating environment 200 includingphysical objects within the operating environment 200.

In some implementations, the HMD includes an integrated display (e.g., abuilt-in display) that displays the home ER environment 206. In someimplementations, the HMD includes a head-mountable enclosure. In variousimplementations, the head-mountable enclosure includes an attachmentregion to which another device with a display can be attached. Invarious implementations, the head-mountable enclosure is shaped to forma receptacle for receiving another device that includes a display (e.g.,the electronic device 203 illustrated in FIG. 2A). For example, in someimplementations, the electronic device 203 slides/snaps into orotherwise attaches to the head-mountable enclosure. In someimplementations, the display of the device attached to thehead-mountable enclosure presents (e.g., displays) the home ERenvironment 206.

FIGS. 2B-2H illustrate changing respective perspective views of thefirst diorama-view representation 230 and the second diorama-viewrepresentation 240 based on various user inputs. As illustrated in FIG.2B, the electronic device 203 detects an input 250 associated with thesecond diorama-view representation 240. The input 250 requests to movethe second diorama-view representation 240 upward along the z-axis ofthe home ER world coordinates 208 to a position above the firstdiorama-view representation 230. For example, in some implementations,an eye tracking sensor of (e.g., integrated within) the electronicdevice 203 detects the input 250, wherein the input 250 corresponds toan eye gaze input that indicates that an eye gaze of the user 210 isdirected towards the second diorama-view representation 240. As anotherexample, in some implementations, a hand tracking sensor of (e.g.,integrated within) the electronic device 203 detects the input 250,wherein the input 250 corresponds to a hand tracking motion made by theuser 210 that is directed towards the second diorama-view representation240.

In response to detecting the input 250 in FIG. 2B, the electronic device203 changes display of the second diorama-view representation 240 from afirst viewing vector to a second viewing vector by moving the seconddiorama-view representation 240 to a position above the firstdiorama-view representation 230, as illustrated in FIG. 2C. A viewingvector provides one or more of a field-of-view (FOV), pose/rotationalcoordinates, translational coordinates, perspective, and/or the likerelative to the user 210 and the home ER coordinate system 208. Whilechanging display of the second diorama-view representation 240 from thefirst viewing vector to the second viewing vector, the electronic device203 maintains the credenza 244 and the chair 246 according to the secondER world coordinates 242. In other words, the electronic device 203maintains the spatial relationship between the credenza 244 and thechair 246 within the second diorama-view representation 240. Forexample, as illustrated in FIGS. 2B and 2C, the longer edge of thecredenza 244 runs substantially along (e.g., parallel to) the y-axis ofthe second ER world coordinates 242. As another example, as illustratedin FIGS. 2B and 2C, the bottom of the legs of the credenza 244 and thebottom of the legs of the chair 246 are at the z=0 location of thez-axis of the second ER world coordinates 242. As yet another example,the electronic device 203 maintains respective position of the credenza244 and the chair 246 relative to each other along the x-y plane of thesecond ER world coordinates 242 in FIGS. 2B and 2C.

As illustrated in FIG. 2D, the electronic device 203 detects a movementinput 251, such as via an IMU of the electronic device 203. The movementinput 251 corresponds to the user 210 moving clockwise (as viewed fromabove) around the first diorama-view representation 230 and the seconddiorama-view representation 240 within the operating environment 200. Inresponse to detecting the movement input 251 in FIG. 2D, thefield-of-view 204 associated with the electronic device 203 and thus thedisplayed home ER environment 206 includes a portion of the physicaltable 220, as is illustrated in FIG. 2E.

In further response to detecting the movement input 251 in FIG. 2D, theelectronic device 203 changes display of the first diorama-viewrepresentation 230 and the second diorama-view representation 240according to the rotation of the home ER environment 206, as illustratedin FIG. 2E. Namely, the electronic device 203 rotates the firstdiorama-view representation 230 and the ER objects therein (e.g., thetelevision 234, the avatar 236, and the couch 238) 90-degrees accordingto the corresponding rotation of the home ER world coordinates 208, asindicated by rotation of the first ER world coordinates 232 between FIG.2D and FIG. 2E. For example, the view of the avatar 236 changes from afront-view of the avatar 236 in FIG. 2D to a side-view of the avatar 236in FIG. 2E. While rotating the first diorama-view representation 230,the electronic device 203 maintains the television 234, the avatar 236,and the couch 238 according to the first ER world coordinates 232. Inother words, the electronic device 203 maintains the spatialrelationship between the television 234, the avatar 236, and the couch238 within the first diorama-view representation 230.

Moreover, as illustrated in FIG. 2E, the electronic device 203 rotatesthe second diorama-view representation 240 and the ER objects therein(e.g., the credenza 244 and the chair 246) 90-degrees according to thecorresponding rotation of the home ER world coordinates 208, asindicated by rotation of the second ER world coordinates 242 betweenFIG. 2D and FIG. 2E. For example, the view of the credenza 244 changesfrom a front-view of the credenza 244 in FIG. 2D to a side-view of thecredenza 244 in FIG. 2E. While rotating the second diorama-viewrepresentation 240, the electronic device 203 maintains the credenza 244and the chair 246 according to the second ER world coordinates 242. Inother words, the electronic device 203 maintains the spatialrelationship between the credenza 244 and the chair 246 within thesecond diorama-view representation 240.

As illustrated in FIG. 2F, the electronic device 203 detects a selectioninput 252 directed to the first diorama-view representation 230. Forexample, the selection input 252 is detected by an eye tracking sensorof the electronic device 203 or by a hand tracking sensor of theelectronic device 203. In response to detecting the selection input 252in FIG. 2F, the electronic device 203 ceases to display the seconddiorama-view representation 240 and maintains display of the firstdiorama-view representation 230, as illustrated in FIG. 2G.

As illustrated in FIG. 2H, the electronic device 203 detects amanipulation input 253 directed to the first diorama-view representation230. The manipulation input 253 rotates the first diorama-viewrepresentation 230 90-degrees counterclockwise (as viewed from above)relative to the home ER world coordinates 208. Accordingly, asillustrated in FIGS. 2H and 2I, the first ER world coordinates 232associated with the first diorama-view representation 230 rotateaccording to the manipulation input 253, whereas the home ER worldcoordinates 208 do not change. Accordingly, the respective views of thetelevision 234, the avatar 236, and the couch 238 change from aside-view to a front-view between FIGS. 2H and 2I.

FIGS. 2J-2N illustrate functionality associated with an ER session thatis associated with the first ER environment. The ER session enablesrespective graphical representations of individuals to be concurrentlywithin the first ER environment. For example, the ER session enables aparticular individual that is represented by the avatar 236 to be withinthe first ER environment. In some implementations, the electronic device203 receives a request to join an ER session and displays acorresponding indication. For example, in some implementations, theindication corresponds to an ER join interface 254, as illustrated inFIG. 2J. Moreover, as illustrated in FIG. 2J, the electronic device 203plays (e.g., via a speaker) a first set of speech data 256 that isassociated with the particular individual that is associated with (e.g.,connected to) the ER session. For example, the first set of speech data256 is a verbal request to join the ER session, such as “Hey, Bob. Comejoin my ER environment and we can virtually watch TV together.” Asanother example, the first set of speech data 256 include ambient noiseassociated with the first ER environment, such as noise from thetelevision 234.

As illustrated in FIGS. 2K and 2L, the electronic device 203 displays ananimation of the avatar 236 based on a corresponding movement of theparticular individual. As illustrated in FIG. 2K, the avatar 236 movesfrom standing beside the couch 238 to sitting on the couch 238 facingthe television 234, as is indicated by movement arrow 257, which isillustrated for purely explanatory purposes. Accordingly, as illustratedin FIG. 2L, the electronic device 203 displays the first diorama-viewrepresentation 230 with the avatar 236 having moved from beside thecouch 238 to sitting on the couch 238 and facing the television 234.

As illustrated in FIG. 2M, the electronic device 203 obtains (e.g., viaan audio sensor) a second set of speech data 258 from the user 210 andprovides the second set of speech data 258 to the particular individual.For example, the second set of speech data 258 is a verbal affirmationto join the ER session, such as “Hey, Jill. Sure, I will join your ERenvironment. I want to watch the baseball game on TV.”

FIGS. 2N-2V illustrate transforming a spatial relationship between asubset of the ER objects within the first diorama-view representation230 as a function of the first ER world coordinates 232 and the home ERworld coordinates 208. As illustrated in FIG. 2N, the first diorama-viewrepresentation 230 has a first height value indicated by a firstreference measurement 260 a, which is illustrated for purely explanatorypurposes.

As illustrated in FIG. 2N, the electronic device 203 detects anacceptance input 259 that accepts the request to join the ER session. Inresponse to detecting the acceptance input 259 in FIG. 2N, theelectronic device 203: ceases to display the first diorama-viewrepresentation 230, transforms a subset of the ER objects within thefirst diorama-view representation 230 as a function of the first ERworld coordinates 232 and the home ER world coordinates 208, anddisplays the subset of the ER objects based on the transformation, asillustrated in FIG. 2O. In some implementations, as illustrated in FIG.2O, the electronic device 203 displays the subset of the ER objectswithin a bounded ER environment 261. As illustrated in FIG. 2O, thebounded ER environment 261 has a second height value indicated by asecond reference measurement 260 b, which is illustrated for purelyexplanatory purposes. Notably, as illustrated in FIG. 2O, the secondheight value indicated by the second reference measurement 260 b islarger than the first height value indicated by the first referencemeasurement 260 a. One of ordinary skill in the art will appreciate thatother dimensional components (e.g., depth and width) of the bounded ERenvironment 261 are larger than corresponding dimensional components ofthe first diorama-view representation 230.

Displaying the bounded ER environment 261 may assist the user 210 inrepositioning the subset of the ER objects in order to avoid physicalobjects and thus avoid visual obstructions, as will be described below.In some implementations, the electronic device 203 displays the subsetof the ER objects without the bounded ER environment 261.

As illustrated in FIG. 2O, the subset of the ER objects includes atransformed television 264 that is based on the television 234, atransformed avatar 266 that is based on the avatar 236, and atransformed couch 268 that is based on the couch 238. The transformedtelevision 264, the transformed avatar 266, and the transformed 268 arearranged according to transformed ER world coordinates 262 that arebased on the first ER world coordinates 232. As compared with the firstdiorama-view representation 230 and the ER objects therein, the boundedER environment 261 and the subset of the ER objects therein moreaccurately represent ER objects that may later be merged into or replacethe home ER environment 206. For example, in some implementations,transforming the subset of the ER objects corresponds to enlarging eachof the subset of the ER objects by a scaling factor. The scaling factormay be predetermined or may be specified by a user input. For example,in some implementations, the scaling factor corresponds to true-scale.In this example, the subset of the ER objects appears substantiallyequivalent to how they would after being merged into or replacing thehome ER environment 206.

In some implementations, electronic device 203 arranges the transformedtelevision 264, the transformed avatar 266, and the transformed couch268 in order to match (within an error threshold) the correspondingarrangement of the television 234, the avatar 236, and the couch 238within the first diorama-view representation 230 as defined by the firstER world coordinates 232 and the home ER world coordinates 208. Forexample, as compared with the corresponding ER objects within the firstdiorama-view representation 230 illustrated in FIG. 2N, the subset ofthe ER objects (264, 266, 268) illustrated in FIG. 2O are similarlylocated and oriented relative to each other.

In some implementations, transformation of the subset of the ER objectsresults in the relative distances between the transformed television264, the transformed avatar 266, and the transformed couch 268 matchingrelative distances of the corresponding ER objects within the firstdiorama-view representation 230 within an error threshold.

In some implementations, transformation of the subset of the ER objectscorresponds to changing resolutions of some or all of the subset of theER objects. For example, in some implementations, the electronic device203 performs object identification (e.g., semantic segmentation orobject tracking) in order to identify dynamic ER objects (e.g., theavatar 236), and reduces resolutions of static ER objects (e.g., thecouch 238).

In some implementations, the bounded ER environment 261 and the subsetof the ER objects are obscured by one or more physical objects withinthe home ER environment 206. For example, as illustrated in FIG. 2O, thebounded ER environment 261 and the subset of the ER objects areinitially displayed along a back wall of the home ER environment 206.Accordingly, were the user 210 to request to merge the subset of the ERobjects into the home ER environment 206, the transformed television 264would be behind and therefore blocked by the back wall, degrading theexperience of the user 210. Accordingly, according to variousimplementations, the electronic device 203 enables movement of thesubset of the ER objects in order to avoid physical obstructions.

For example, referring to FIG. 2P, the electronic device 203 detects aninput 270 that corresponds to moving the bounded ER environment 261 andthe subset of the ER objects therein away from the back wall of the homeER environment 206. In response to detecting the input 270 in FIG. 2P,the electronic device 203 moves the bounded ER environment 261 and thesubset of the ER objects therein away from the back wall to above thephysical table 220, as illustrated in FIG. 2Q. However, as illustratedin FIG. 2Q, the transformed avatar 266 and the transformed couch 268obstruct a portion of the physical table 220.

As illustrated in FIG. 2R, the electronic device 203 detects an input272 that corresponds to further moving the bounded ER environment 261and the subset of the ER objects therein down along the z-axis of thehome ER world coordinates 208 away from the physical table 220. Inresponse to detecting the input 272 in FIG. 2R, the electronic device203 moves the bounded ER environment 261 and the ER objects therein downalong the z-axis of the home ER world coordinates 208. Accordingly, asillustrated in FIG. 2S, the transformed avatar 266 and the transformedcouch 268 no longer obstruct the physical table 220, improving theexperience of the user 210.

As illustrated in FIG. 2T, the electronic device 203 detects an input274 directed to the bounded ER environment 261. The input 274corresponds to a request to enter the first ER environment. In someimplementations, in response to detecting the input 274 in FIG. 2T, theelectronic device 203 adds a true-scale (e.g., full-size) television282, a true-scale avatar 284, and a true-scale couch 286 to the home ERenvironment 206, as illustrated in FIG. 2U. Notably, as illustrated inFIG. 2U, the true-scale television 282 is within the field-of-view 204,but the true-scale avatar 284 and the true-scale couch 286 are not.However, were the user 210 to rotate the electronic device 203 clockwise90 degrees (as viewed from above), such as by turning his or her headtowards the true-scale avatar 284, then the true-scale avatar 284 andthe true-scale couch 286 would be within the field-of-view 204.

In some implementations, adding the subset of the ER objects to the homeER environment 206 corresponds to resizing the subset of the ER objectsto true-scale. As illustrated in FIG. 2U, the transformed avatar 266 andthe transformed couch 268 do not obstruct the physical table 220.Accordingly, the user 210 may interact with the true-scale ER objectsindependent of providing additional inputs, leading to a more pleasantexperience and reducing processor, memory, and battery utilization ofthe electronic device 203.

In some implementations, in response to detecting the input 274 in FIG.2T, the electronic device 203 replaces the home ER environment 206 withthe first ER environment 290, such as when the first ER environment 290is a pure VR environment, as illustrated in FIG. 2V. The first ERenvironment 290 includes the true-scale television 282, the true-scaleavatar 284, and the true-scale couch 286, but not the physical table220.

FIG. 3 is a flow diagram of a method 300 of changing a perspective viewof a diorama-view representation of an ER environment based on a userinput in accordance with some implementations. In variousimplementations, the method 300 or portions thereof are performed by anelectronic device (e.g., the electronic device 100 in FIG. 1 or theelectronic device 203 in FIGS. 2A-2V). In various implementations, themethod 300 or portions thereof are performed by an HMD. In someimplementations, the method 300 is performed by processing logic,including hardware, firmware, software, or a combination thereof. Insome implementations, the method 300 is performed by a processorexecuting code stored in a non-transitory computer-readable medium(e.g., a memory). In various implementations, some operations in method300 are, optionally, combined and/or the order of some operations is,optionally, changed.

As represented by block 302, in some implementations, the method 300includes generating a plurality of diorama-view representationsrespectively corresponding to a plurality of ER environments. In someimplementations, the method 300 includes obtaining a plurality ofcharacterization vectors that respectively provide a plurality ofspatial characterizations of the corresponding plurality of ERenvironments. Each of the plurality of characterization vectors includesa plurality of object label values that respectively identify one ormore ER objects. Each of the plurality of characterization vectors alsoincludes a plurality of relative position values providing respectivepositions of the one or more ER objects relative to each other.Moreover, the method 300 includes generating, from the correspondingplurality of ER environments and the plurality of characterizationvectors, the plurality of diorama-view representations of thecorresponding plurality of ER environments according to the relativeposition values. For example, an object label value may include instanceobject label values and/or semantic object label values. As anotherexample, an object label value may include one sub-value or multiplesub-values. As yet example, a particular volumetric region of an ERenvironment may include three objects and thus has three correspondingobject label values, such as “first object,” “second object,” and “thirdobject.” An object label value may be associated with an object and/or afeature thereof (e.g., table, chair, corner, edge, etc.). Moreover, anobject label value may identify a moving object, such as an avatar of aparticular individual within the ER environment. In someimplementations, the spatial characterization corresponds to avolumetric (e.g., three-dimensional (3D)) characterization.

In some implementations, generating the plurality of diorama-viewrepresentations is based on a scaling factor, such as scaling down thecorresponding plurality of ER environments by a scaling factor. Forexample, in some implementations, the scaling factor is predetermined.As another example, the scaling factor is obtained via a user input,such as an input that is directed to an affordance of a touch-sensitivesurface of the electronic device or an input that corresponds to an eyegaze location of a user. As yet another example, in someimplementations, the method 300 includes determining the scaling factorbased on physical objects within a physical environment. To that end, insome implementations, the method 300 includes obtaining, via the imagesensor, pass-through image data bounded by a field-of-view of a physicalenvironment associated with the image sensor; identifying, within thepass-through image data, one or more physical objects within thephysical environment; and determining the scaling factor based on theone or more physical objects. For example, the scaling factor may bebased on respective positions and/or sizes (e.g., volumes) of the one ormore physical objects, such as scaling down more when there arerelatively large physical objects within the physical environment. Insome implementations, the scaling factor is based on dimensions (e.g.,volume) of a respective ER environment.

As represented by block 304, the method 300 includes displaying, via adisplay device, the plurality of diorama-view representations from acorresponding plurality of viewing vectors. The plurality ofdiorama-view representations corresponds to a plurality of enhancedreality (ER) environments. Each of the plurality of diorama-viewrepresentations is associated with a respective set of ER worldcoordinates that characterizes a respective ER environment. Theplurality of diorama-view representations includes a first one of theplurality of diorama-view representations displayed from a first viewingvector. The first one of the plurality of diorama-view representationsincludes a first one or more of ER objects arranged according to a firstset of ER world coordinates. As another example, each of the pluralityof diorama-view representations corresponds to a bounded regionassociated with a respective set of ER world coordinates. Theorientation, position, location, etc. of a particular plurality of ERobjects within a corresponding diorama-view representation is based onthe respective set of ER world coordinates. The ER objects may include acombination of movable inanimate objects (e.g., a chair or table),living objects (e.g., one or more avatars representing an individual),and inanimate structural objects (e.g., ceiling, wall, boundary of thediorama-view representation). In some implementations, the electronicdevice displays the plurality of diorama-view representations inaccordance with determining satisfaction of a spatial proximitythreshold. For example, diorama-view representations are spacedsufficiently far away from each other so as to avoid obscuring eachother. A particular viewing vector provides one or more of: afield-of-view (FOV), pose/rotational coordinates, translationalcoordinates, a perspective, and/or the like.

In some implementations, the method 300 includes displaying theplurality of diorama-view representations within a home ER environment,such as the home ER environment 206 in FIGS. 2A-2V. As one example, withreference to FIG. 2A, the electronic device 203 displays a firstdiorama-view representation 230 of a first ER environment and a seconddiorama-view representation 240 of a second ER environment within thehome ER environment 206. Continuing with this example, the firstdiorama-view representation 230 includes a television 234, an avatar236, and a couch 238 arranged according to first ER world coordinates232. Continuing with this example, the second diorama-viewrepresentation 240 includes a credenza 244 and a chair 246 arrangedaccording to second ER world coordinates 242.

In some implementations, the method 300 includes displaying, within aparticular one of the plurality of diorama-view representations, arecording of activity within a respective ER environment associated withthe particular one of the plurality of diorama-view representations. Insome implementations, the recording includes activity within theparticular one of the plurality of diorama-view representations over aprevious span of time. For example, the recording includes activity overthe last day, indicating that five people entered and left a respectiveER environment and three of those people drew on a whiteboard within therespective ER environment.

In some implementations, the electronic device displays the plurality ofdiorama-view representations based on control values. For example, thecontrol values may be set via a user input. Control values may include,for example, a combination of whether or not to mute audio, whether apreviewing user may communicate with individuals who have joined thefirst ER session, and/or whether the previewing user sees a history(e.g., log) of who has joined and left and when.

In some implementations, the electronic device displays the plurality ofdiorama-view representations based on eye gaze data. To that end, themethod 300 includes obtaining eye gaze data indicative of an eye gazelocation, wherein displaying the plurality of diorama-viewrepresentations is based on the eye gaze data. For example, anelectronic device renders and displays a particular one of the pluralityof diorama-view representation with high resolution (e.g., emphasized)when the eye gaze data indicates that the eye gaze location is closestto the particular one of the a plurality of diorama-view representation.As another example, the electronic device does not display another oneof the plurality of diorama-view representation when the eye gaze dataindicates that the eye gaze location has not been within a thresholddistance from the other one of the plurality of diorama-viewrepresentation for a threshold amount of time. As yet another example,the electronic device ceases performing live updates on (e.g., keepingstatic) contents of a portion of the plurality of diorama-viewrepresentations that are not within a threshold distance from the eyegaze location.

As represented by block 306, in some implementations, the plurality ofdiorama-view representations corresponds to reduced-sizedrepresentations of the corresponding plurality of ER environments. Asone example, the television 234 within the first diorama-viewrepresentation 230 illustrated in FIG. 2N corresponds to a reduced-sizedversion of the true-scale television 282 illustrated in FIG. 2U. In someimplementations, the number of ER objects within a particulardiorama-view representation is the same as or fewer than the number ofER objects with a corresponding ER environment. For example, the method300 includes selectively rendering and/or displaying certain objects,such as displaying living objects (e.g., the avatar 236), but notrendering structural features (e.g., walls) of the home ER environment.In some implementations, the method 300 includes rendering differentfeatures of a particular diorama-view representation with differentlevel of fidelity (e.g., resolution). By selectively rendering and/ordisplaying, an electronic device reduces utilization of processing andbattery resources.

As represented by block 308, in some implementations, displaying theplurality of diorama-view representations includes animating a portionof the plurality of diorama-view representations. As one example, withreference to FIGS. 2K and 2L, the electronic device 203 displaysmovement of the avatar 236 because an individual associated with theavatar 236 is moving within his or her respective physical environment,causing a resulting movement of his or her avatar 236 within the ERenvironment.

As represented by block 310, in some implementations, at least a subsetof the corresponding plurality of ER environments is respectivelyassociated with a plurality of ER environments corresponding to aplurality of ER sessions. Each of the plurality of ER environmentsenables graphical representations of individuals to be concurrentlywithin the ER environment. According to various implementations, each ofthe one or more corresponding individuals is associated with arespective access level that satisfies an access level criterion that isassociated with the respective ER session, such as is managed by theprivacy subsystem 170 in FIG. 1. Accordingly, for example, a particularindividual who has no preexisting relationship with any of theindividuals currently associated with an ER environment cannot view acorresponding diorama-view representation.

As represented by block 312, in some implementations, at least a subsetof the plurality of diorama-view representations includes one or more ERrepresentations (e.g., avatars) of one or more correspondingindividuals. Each of the one or more corresponding individuals isassociated with a respective ER session. As one example, with referenceto FIG. 2A, the electronic device 203 displays an avatar 236 that isassociated with an ER session, which, itself, is associated with thefirst ER environment. In some implementations, the method 300 includesproviding two-way audio between an electronic device and an avatarwithin a diorama-view representation. To that end, the method 300includes playing, via a speaker of the electronic device, a first set ofspeech data while displaying the plurality of diorama-viewrepresentations. The first set of speech data is associated with one ormore corresponding individuals that are associated with a particular ERsession associated with a respective ER environment. For example, thefirst set of speech data is uttered by the one or more correspondingindividuals. As another example, the playback of the first set of speechdata is in real time or is delayed via buffering. As yet anotherexample, the first set of speech data may be spatialized so as to soundas though originating from within a diorama-view representation.Moreover, the method 300 includes obtaining, via an audio sensor of theelectronic device, a second set of speech data from a user associatedwith the electronic device. Moreover, the method 300 includes providingthe second set of speech data to the respective ER environment so thatthe second set of speech data is audible to the one or morecorresponding individuals that are associated with the particular ERsession. The second set of speech data may be presented to participantsof respective ER environments in a variety of ways, such as spatializedaudio from above the respective ER environments.

As represented by block 314, in some implementations, the first one ofthe plurality of diorama-view representations corresponds to anaugmented reality (AR) representation of a first ER environment. Forexample, the first one of the plurality of diorama-view representationsincludes AR content overlaid on environmental data that is associatedwith physical features of an operating environment. For example, withreference to FIG. 2A, the couch 238 within the first diorama-viewrepresentation 230 may correspond to a physical couch that is associatedwith the first ER environment. In some implementations, the method 300includes obtaining (e.g., via an image sensor) the environmental data(e.g., pass-through image data), generating ER objects, and overlayingthe ER objects on the environmental data.

As represented by block 316, in some implementations, the first one ofthe plurality of diorama-view representations corresponds to a virtualreality (VR) representation of a first ER environment. For example, thesecond diorama-view representation 240, including structural objects(e.g., the floor and walls), the credenza 244, and the chair 246correspond to ER objects.

As represented by block 318, the method 300 includes detecting, via oneor more input devices, an input associated with the first one of theplurality of diorama-view representations. As represented by block 320,in some implementations, the input is directed to the first one of theplurality of diorama-view representations. As one example, withreference to FIG. 2H, the manipulation input 253 is directed to thefirst diorama-view representation 230. For example, the input changesthe orientation of (e.g., rotating) the first one of the plurality ofdiorama-view representations. As another example, the input moves thefirst one of the plurality of diorama-view representations along anaxis, such as translating the first one of the plurality of diorama-viewrepresentations along an x-y axis.

As represented by block 322, in some implementations, the inputcorresponds to a change in position of the electronic device from afirst pose to a second pose relative to the first one of the pluralityof diorama-view representations. As represented by block 324, in someimplementations, the input is directed to multiple diorama-viewrepresentations. For example, the electronic device includes an IMU thatdetects positional changes of the electronic device. As one example,with reference to FIG. 2D, the electronic device 203 detects themovement input 251 that corresponds to the user 210 moving around thefirst diorama-view representation 230 and the second diorama-viewrepresentation 240 within the operating environment 200. In response todetecting the movement input 251, the electronic device 203 changes thefirst diorama-view representation 230 and the second diorama-viewrepresentation 240 from a first pose to a second pose.

As represented by block 326, the method 300 includes, in response todetecting the input, changing display of the first one of the pluralityof diorama-view representations from the first viewing vector to asecond viewing vector while maintaining the first one or more ER objectsarranged according to the first set of ER world coordinates. As oneexample, with reference to FIGS. 2D and 2E, in response to detecting themovement input 251, the electronic device 203 correspondingly changesrespective viewing vectors associated with the first diorama-viewrepresentation 230 and the second diorama-view representation 240, asindicated by respective changes to the first ER world coordinates 232and the second ER world coordinates 242. As another example, withreference to FIGS. 2H and 21, in response to detecting the manipulationinput 253 directed to the first diorama-view representation 230, theelectronic device 203 correspondingly rotates the first diorama-viewrepresentation 230, as indicated by a change to the first ER worldcoordinates 232 relative to the home ER world coordinates 208.

FIG. 4 is a flow diagram of a method 400 of transforming a spatialrelationship between a subset of ER objects as a function of respectivecoordinates associated with a corresponding diorama-view representationand home ER world coordinates in accordance with some implementations.In various implementations, the method 400 or portions thereof areperformed by an electronic device (e.g., the electronic device 100 inFIG. 1 or the electronic device 203 in FIGS. 2A-2V). In variousimplementations, the method 400 or portions thereof are performed by anHMD. In some implementations, the method 400 is performed by processinglogic, including hardware, firmware, software, or a combination thereof.In some implementations, the method 400 is performed by a processorexecuting code stored in a non-transitory computer-readable medium(e.g., a memory). In various implementations, some operations in method400 are, optionally, combined and/or the order of some operations is,optionally, changed.

As represented by block 402, in some implementations, the method 400includes displaying a plurality of diorama-view representations of acorresponding plurality of ER environments within a home ER environment.The plurality of diorama-view representations includes a firstdiorama-view representation of a first ER environment. As one example,with reference to FIG. 2A, the electronic device 203 displays a firstdiorama-view representation 230 of a first ER environment and a seconddiorama-view representation 240 of a second ER environment within thehome ER environment 206. Continuing with this example, the firstdiorama-view representation 230 includes a television 234, an avatar236, and a couch 238 arranged according to first ER world coordinates232. Continuing with this example, the second diorama-viewrepresentation 240 includes a credenza 244 and a chair 246 arrangedaccording to second ER world coordinates 242.

In some implementations, as represented by block 404, the method 400includes detecting a selection input that selects the first diorama-viewrepresentation from the plurality of diorama-view representations. Asone example, with reference to FIG. 2F, the electronic device 203detects the selection input 252 that is directed to the firstdiorama-view representation 230. In some implementations, as representedby block 406, the select input is based on eye gaze data obtained by,for example, an eye tracking sensor integrated within an electronicdevice or HMD. For example, the eye gaze data indicates that an eye gazelocation of a user is directed to the first diorama-view representationof the plurality of diorama-view representations.

As represented by block 408, the method 400 includes displaying a homeER environment characterized by home ER world coordinates, including thefirst diorama-view representation of the first ER environment. The firstdiorama-view representation includes one or more of ER objects arrangedin a spatial relationship according to first ER world coordinates. Insome implementations, displaying the first diorama-view representationincludes ceasing to display the remainder of the plurality ofdiorama-view representations. In some implementations, displaying thefirst diorama-view representation includes displaying the remainder ofthe plurality of diorama-view representations with a lower resolutionthan the first diorama-view representation. In some implementations,displaying the first diorama-view representation includes ceasing toperform live updates of contents of the first diorama-viewrepresentation. In some implementations, the first diorama-viewrepresentation corresponds to a reduced-size representation of the firstER environment. As an example, the one or more ER objects includes acombination of movable objects (e.g., a chair or table), living objects(e.g., one or more avatars representing respective individuals),structural objects (e.g., ceiling, wall, boundary of the diorama-viewrepresentation), anchored objects (e.g., a bookshelf), and/or the like.The orientation, position, location, etc. of the one or more ER objectsis based on the first ER world coordinates. In some implementations, asrepresented by block 410, the home ER environment includes one or morephysical objects associated with the home ER world coordinates. As oneexample, with reference to FIG. 2A, the home ER environment 206 includesa physical table 220.

As represented by block 412, the method 400 includes detecting a firstinput that is directed to the first diorama-view representation. As oneexample, with reference to FIG. 2N, the acceptance input 259 accepts aninvitation to join an ER session associated with the first diorama-viewrepresentation 230. For example, the first input is associated with aninterest to initiate (e.g., join or display) the first ER environment,such as a voice input, picking up or putting down the first diorama-viewrepresentation (e.g., as detected via a hand sensor), etc.

As represented by block 414, the method 400 includes, in response todetecting the first input, transforming the home ER environment.According to some implementations, transforming the home ER environmentincludes ceasing to display the first diorama-view representation withinthe home ER environment and transforming the spatial relationshipbetween a subset of the one or more ER objects as a function of the homeER world coordinates and the first ER world coordinates. As representedby block 416, the method 400 includes displaying the subset of the oneor more ER objects within the home ER environment based on thetransformation. As one example, in response to detecting the acceptanceinput 259 that selects the first diorama-view representation 230 in FIG.2N, the electronic device 203 displays the bounded ER environment 261including the transformed television 264, the transformed avatar 266,and the transformed couch 268, as illustrated in FIG. 2O. For example,the subset of the one or more ER objects corresponds to respectiverepresentations of the corresponding ER objects. As another example, thesubset of the one or more ER objects may include some or all of the oneor more ER objects. In some implementations, transforming the spatialrelationship includes increasing respective sizes of the subset of theone or more ER objects as compared with the same objects within thefirst diorama-view representation. In some implementations, transformingthe spatial relationship includes increasing distances between thesubset of the one or more ER objects as compared with the same objectswithin the first diorama-view representation, while maintaining relativedistances between the subset of the one or more ER objects. For example,in the first diorama-view representation an ER stapler is 1 inch from anER table and 3 inches from an ER wall and the transformation changes thedistance between the ER stapler and the ER table to 2 inches and thedistance between the ER stapler and the ER wall to 6 inches. In someimplementations, transforming the spatial relationship includes movingthe subset of the one or more ER objects relative to the home ERenvironment, such as dragging the subset of the one or more ER objectsalong an x-y axis. In some implementations, transforming the spatialrelationship includes continuously increasing respective sizes of thesubset of the one or more ER objects until true-scale representations ofthe subset of the one or more ER objects are achieved.

In some implementations, as represented by block 418, the method 400includes, in response to detecting the first input, displaying thesubset of the one or more ER objects from a first viewing vector. Forexample, the viewing vector provides one or more of: a field-of-view(FOV), pose/rotational coordinates, translational coordinates, aperspective, and/or the like.

In some implementations, as represented by block 420, the method 400includes moving the subset of the one or more ER objects based onidentified physical object(s). To that end, the method 400 includesobtaining, via an image sensor, environmental data (e.g., pass-throughimage data) bounded by a field-of-view associated with the image sensor,wherein the environmental data is associated with a physical environmentincluding one or more physical objects. Moreover, the method 400includes identifying, within the environmental data, a particular one ofthe one or more physical objects located within a spatial proximitythreshold of the subset of the one or more ER objects and moving thesubset of the one or more ER objects relative to the one or morephysical objects based on the particular one of the one or more physicalobjects. For example, the method 400 includes identifying the particularone of the one or more physical objects via instance segmentation and/orsemantic segmentation. As another example, the particular one of the oneor more physical objects is larger than a volumetric threshold and thusis likely to obscure the subset of the one or more ER objects. In someimplementations, the particular one of the one or more physical objectssatisfies a spatial proximity threshold with respect to the subset ofthe one or more ER objects. For example, the spatial proximity thresholdis satisfied when the particular one of the one or more physical objectsis less than a threshold distance away from the subset of the one ormore ER objects. In some implementations, the method 400 includesdisplaying an indicator that is indicative of the particular one of theone or more physical objects, such as a color overlay or animation thatis on the surface of a portion of the particular one of the one or morephysical objects (e.g., “do not move the ER environment downwardsbecause there is a large wall there”).

In some implementations, as represented by block 422, the method 400includes detecting a second input. As one example, the second input isdirected to the subset of the one or more ER objects. For example, thesecond input is directed to a separate affordance, such as pressing aleft arrow in order to move a diorama-view representation leftwards. Insome implementations, the second input has three degrees of freedom. Forexample, a diorama-view representation may be spun, rotated, and/ortranslated along an x-z plane, but not moved up or down in a ydirection. In some implementations, the second input is detected via aninertial measurement unit (IMU), such as a gyroscope or accelerometer.As another example, the second input corresponds to a change in positionof the electronic device from a first pose to a second pose relative tothe subset of the one or more ER objects.

In some implementations, as represented by block 424, the second inputis detected via a hand tracking sensor or an eye tracking sensor. Tothat end, in some implementations, the method 400 includes detecting thesecond input via the hand tracking sensor; obtaining hand tracking datafrom the hand tracking sensor based on the second input; anddetermining, from the hand tracking data, that the second input isdirected to the subset of the one or more ER objects. For example, themethod 400 includes comparing a hand position indicated by the handtracking data against the position of the subset of the one or more ERobjects. In some implementations, the method 400 includes detecting thesecond input via the eye tracking sensor; obtaining eye tracking datafrom the eye tracking sensor based on the second input; and determining,from the eye tracking data, that the second input is directed to thesubset of the one or more ER objects. For example, the method 400includes determining the second input is directed to the subset of theone or more ER objects by comparing an eye gaze position indicated bythe eye gaze data against the position of the subset of the one or moreER objects.

In some implementations, as represented by block 426, the method 400includes, in response to detecting the second input, changing display ofthe subset of the one or more ER objects from the first viewing vectorto a second viewing vector while maintaining the subset of the one ormore ER objects arranged according to the first ER world coordinates.For example, changing display of the subset of the one or more ERobjects from the first viewing vector to a second viewing vector mayinclude a combination of rotating, flipping, or moving along a fixedaxis with a nominal amount of rotation, etc. As one example, in responseto detecting the input 272 in FIG. 2R, the electronic device 203 movesthe transformed television 264, the transformed avatar 266, and thetransformed couch 268 from above the physical table 220 downwards alongthe z-axis of the home ER world coordinates 208 to the side of thephysical table 220 in FIG. 2S. Continuing with this example, theelectronic device 203 maintains arrangement of the transformedtelevision 264, the transformed avatar 266, and the transformed couch268 according to the transformed ER world coordinates 262. In someimplementations, as represented by block 428, changing display of thesubset of the one or more ER objects from the first viewing vector tothe second viewing vector is based on the change in position of theelectronic device from a first pose to a second pose. In someimplementations, as represented by block 430, changing display of thesubset of the one or more ER objects from the first viewing vector tothe second viewing vector includes moving the subset of the one or moreER objects relative to the one or more physical objects.

In some implementations, as represented by block 432, the method 400includes detecting an input requesting to enter the first ERenvironment. As one example, with reference to FIG. 2T, the input 274 isdirected to the bounded ER environment 261 and corresponds to a requestto enter the bounded ER environment 261. For example, the input 274corresponds to one of an eye gaze input, a hand gesture, a voice input,a touch input, and/or the like. As another example, the input 274corresponds to placing or dropping a diorama-view representation on theground in order to enable the diorama-view representation expand totrue-scale or in order to enable the diorama-view representation replacethe current environment.

In some implementations, as represented by block 434, the method 400includes, in response to detecting the input requesting to enter thefirst ER environment, adding the subset of the one or more ER objects tothe home ER environment. As one example, in response to detecting theinput 274 in FIG. 2T, the electronic device 203 adds the transformedtelevision 264, the transformed avatar 266, and the transformed couch268 to the home ER environment 206 including the physical table 220, asillustrated in FIG. 2U. In some implementations, adding the subset ofthe one or more ER objects to the home ER environment includesoverlaying the subset of the one or more ER objects on physical objectswithin the home ER environment. In some implementations, adding thesubset of the one or more ER objects to the home ER environment includesskinning physical features (e.g., structural features, such as walls,floor) of the first ER environment and adding the physical features, inaddition to adding the subset of the one or more ER objects, to the homeER environment.

In some implementations, as represented by block 436, the method 400includes, in response to detecting the input requesting to enter thefirst ER environment, replacing the home ER environment with the firstER environment that includes the subset of the one or more ER objects.For example, replacing the home ER environment with the first ERenvironment corresponds to replacing the home ER environment with apurely VR environment. As one example, in response to detecting theinput 274 in FIG. 2T, the electronic device 203 replaces the home ERenvironment 206 with the first ER environment 290, as illustrated inFIG. 2V. As contrasted with the home ER environment 206 illustrated inFIG. 2U, the first ER environment 290 corresponds to a purely VRenvironment and thus does not include the physical table 220.

The present disclosure describes various features, no single one ofwhich is solely responsible for the benefits described herein. It willbe understood that various features described herein may be combined,modified, or omitted, as would be apparent to one of ordinary skill.Other combinations and sub-combinations than those specificallydescribed herein will be apparent to one of ordinary skill, and areintended to form a part of this disclosure. Various methods aredescribed herein in connection with various flowchart steps and/orphases. It will be understood that in many cases, certain steps and/orphases may be combined together such that multiple steps and/or phasesshown in the flowcharts can be performed as a single step and/or phase.Also, certain steps and/or phases can be broken into additionalsub-components to be performed separately. In some instances, the orderof the steps and/or phases can be rearranged and certain steps and/orphases may be omitted entirely. Also, the methods described herein areto be understood to be open-ended, such that additional steps and/orphases to those shown and described herein can also be performed.

Some or all of the methods and tasks described herein may be performedand fully automated by a computer system. The computer system may, insome cases, include multiple distinct computers or computing devices(e.g., physical servers, workstations, storage arrays, etc.) thatcommunicate and interoperate over a network to perform the describedfunctions. Each such computing device typically includes a processor (ormultiple processors) that executes program instructions or modulesstored in a memory or other non-transitory computer-readable storagemedium or device. The various functions disclosed herein may beimplemented in such program instructions, although some or all of thedisclosed functions may alternatively be implemented inapplication-specific circuitry (e.g., ASICs or FPGAs or GP-GPUs) of thecomputer system. Where the computer system includes multiple computingdevices, these devices may be co-located or not co-located. The resultsof the disclosed methods and tasks may be persistently stored bytransforming physical storage devices, such as solid-state memory chipsand/or magnetic disks, into a different state.

Various processes defined herein consider the option of obtaining andutilizing a user's personal information. For example, such personalinformation may be utilized in order to provide an improved privacyscreen on an electronic device. However, to the extent such personalinformation is collected, such information should be obtained with theuser's informed consent. As described herein, the user should haveknowledge of and control over the use of their personal information.

Personal information will be utilized by appropriate parties only forlegitimate and reasonable purposes. Those parties utilizing suchinformation will adhere to privacy policies and practices that are atleast in accordance with appropriate laws and regulations. In addition,such policies are to be well-established, user-accessible, andrecognized as in compliance with or above governmental/industrystandards. Moreover, these parties will not distribute, sell, orotherwise share such information outside of any reasonable andlegitimate purposes.

Users may, however, limit the degree to which such parties may access orotherwise obtain personal information. For instance, settings or otherpreferences may be adjusted such that users can decide whether theirpersonal information can be accessed by various entities. Furthermore,while some features defined herein are described in the context of usingpersonal information, various aspects of these features can beimplemented without the need to use such information. As an example, ifuser preferences, account names, and/or location history are gathered,this information can be obscured or otherwise generalized such that theinformation does not identify the respective user.

The disclosure is not intended to be limited to the implementationsshown herein. Various modifications to the implementations described inthis disclosure may be readily apparent to those skilled in the art, andthe generic principles defined herein may be applied to otherimplementations without departing from the spirit or scope of thisdisclosure. The teachings of the invention provided herein can beapplied to other methods and systems, and are not limited to the methodsand systems described above, and elements and acts of the variousimplementations described above can be combined to provide furtherimplementations. Accordingly, the novel methods and systems describedherein may be implemented in a variety of other forms; furthermore,various omissions, substitutions and changes in the form of the methodsand systems described herein may be made without departing from thespirit of the disclosure. The accompanying claims and their equivalentsare intended to cover such forms or modifications as would fall withinthe scope and spirit of the disclosure.

What is claimed is:
 1. A method comprising: at an electronic deviceincluding one or more processors, a non-transitory memory, one or moreinput devices, and a display device: displaying, via the display device,a home enhanced reality (ER) environment characterized by home ER worldcoordinates, including a first diorama-view representation of a first ERenvironment, wherein the first diorama-view representation includes oneor more of ER objects arranged in a spatial relationship according tofirst ER world coordinates; detecting, via the one or more inputdevices, a first input that is directed to the first diorama-viewrepresentation; and in response to detecting the first input,transforming the home ER environment by: ceasing to display the firstdiorama-view representation within the home ER environment, transformingthe spatial relationship between a subset of the one or more ER objectsas a function of the home ER world coordinates and the first ER worldcoordinates, and displaying, via the display device, the subset of theone or more ER objects within the home ER environment based on thetransformation.
 2. The method of claim 1, further comprising: whiledisplaying the subset of the one or more ER objects within the home ERenvironment based on the transformation, detecting, via the one or moreinput devices, a second input; and in response to detecting the secondinput, adding the subset of the one or more ER objects to the home ERenvironment.
 3. The method of claim 1, further comprising: whiledisplaying the subset of the one or more ER objects within the home ERenvironment based on the transformation, detecting, via the one or moreinput devices, a second input; and in response to detecting the secondinput, replacing the home ER environment with the first ER environmentthat includes the subset of the one or more ER objects.
 4. The method of1, wherein, in response to detecting the first input, displaying thesubset of the one or more ER objects from a first viewing vector, themethod further comprising: while displaying the subset of the one ormore ER objects within the home ER environment based on thetransformation, detecting, via the one or more input devices, a secondinput; and in response to detecting the second input, changing displayof the subset of the one or more ER objects from the first viewingvector to a second viewing vector while maintaining the subset of theone or more ER objects arranged according to the first ER worldcoordinates.
 5. The method of claim 4, wherein the second input isdirected to the subset of the one or more ER objects.
 6. The method ofclaim 5, wherein the one or more input devices includes a hand trackingsensor, the method further comprising: detecting the second input viathe hand tracking sensor; obtaining hand tracking data from the handtracking sensor based on the second input; and determining, from thehand tracking data, that the second input is directed to the subset ofthe one or more ER objects.
 7. The method of claim 5, wherein the one ormore input devices includes an eye tracking sensor, the method furthercomprising: detecting the second input via the eye tracking sensor;obtaining eye tracking data from the eye tracking sensor based on thesecond input; and determining, from the eye tracking data, that thesecond input is directed to the subset of the one or more ER objects. 8.The method of claim 4, wherein the home ER environment includes one ormore physical objects that are associated with the home ER worldcoordinates, and wherein changing display of the subset of the one ormore ER objects from the first viewing vector to the second viewingvector includes moving the subset of the one or more ER objects relativeto the one or more physical objects.
 9. The method of claim 4, whereinthe second input corresponds to a change in position of the electronicdevice from a first pose to a second pose relative to the subset of theone or more ER objects, and wherein changing display of the subset ofthe one or more ER objects from the first viewing vector to the secondviewing vector is based on the change in position of the electronicdevice from the first pose to the second pose.
 10. The method of claim1, further comprising: obtaining, via an image sensor, environmentaldata bounded by a field-of-view associated with the image sensor,wherein the environmental data is associated with a physical environmentincluding one or more physical objects; identifying, within theenvironmental data, a particular one of the one or more physical objectslocated within a spatial proximity threshold of the subset of the one ormore ER objects; and moving the subset of the one or more ER objectsrelative to the one or more physical objects based on the particular oneof the one or more physical objects.
 11. The method of claim 1, furthercomprising: displaying, via the display device, a plurality ofdiorama-view representations of a corresponding plurality of ERenvironments within the home ER environment, wherein the plurality ofdiorama-view representations includes the first diorama-viewrepresentation; and in response to detecting the first input, selectingthe first diorama-view representation from the plurality of diorama-viewrepresentations.
 12. The method of claim 1, further comprising:displaying, via the display device, a plurality of diorama-viewrepresentations of a corresponding plurality of ER environments withinthe home ER environment, wherein the plurality of diorama-viewrepresentations includes the first diorama-view representation; anddetecting, via the one or more input devices, a selection input thatselects the first diorama-view representation from the plurality ofdiorama-view representations; and in response to detecting the selectioninput, maintaining display of the first diorama-view representationwithin the home ER environment and ceasing to display the remainder ofthe plurality of diorama-view representations.
 13. The method of claim12, further comprising obtaining, via an eye tracking sensor, eye gazedata indicative of an eye gaze location, wherein the selection input isbased on the eye gaze location.
 14. The method of claim 1, wherein thefirst ER environment is associated with an ER session that enablesrespective graphical representations of individuals to be concurrentlywithin the first ER environment.
 15. The method of claim 14, wherein thefirst ER environment includes one or more ER representationsrespectively associated with one or more individuals that are connectedto the ER session.
 16. The method of claim 15, wherein each of the oneor more individuals has a respective access level that satisfies anaccess level criterion that is associated with the ER session.
 17. Themethod of claim 15, further comprising: while displaying the firstdiorama-view representation: playing, via a speaker of the electronicdevice, a first set of speech data that is associated with the one ormore individuals that are connected to the ER session; obtaining, via anaudio sensor of the electronic device, a second set of speech data froma user that is associated with the electronic device; and providing thesecond set of speech data to the one or more individuals that areconnected to the ER session.
 18. The method of claim 1, furthercomprising: obtaining a characterization vector that provides a spatialcharacterization of the first ER environment, wherein thecharacterization vector includes a plurality of object label values thatrespectively identify the one or more ER objects, and wherein thecharacterization vector also includes a plurality of relative positionvalues providing respective positions of the one or more ER objectsrelative to each other; and generating, from the first ER environmentand the characterization vector, the first diorama-view representation.19. An electronic device comprising: one or more processors; anon-transitory memory; one or more input devices; a display device; andone or more programs, wherein the one or more programs are stored in thenon-transitory memory and configured to be executed by the one or moreprocessors, the one or more programs including instructions for:displaying, via the display device, a home enhanced reality (ER)environment characterized by home ER world coordinates, including afirst diorama-view representation of a first ER environment, wherein thefirst diorama-view representation includes one or more of ER objectsarranged in a spatial relationship according to first ER worldcoordinates; detecting, via the one or more input devices, a first inputthat is directed to the first diorama-view representation; and inresponse to detecting the first input, transforming the home ERenvironment by: ceasing to display the first diorama-view representationwithin the home ER environment, transforming the spatial relationshipbetween a subset of the one or more ER objects as a function of the homeER world coordinates and the first ER world coordinates, and displaying,via the display device, the subset of the one or more ER objects withinthe home ER environment based on the transformation.
 20. Anon-transitory computer readable storage medium storing one or moreprograms, the one or more programs comprising instructions, which, whenexecuted by an electronic device including one or more input devices anda display device, cause the electronic device to: display, via thedisplay device, a home enhanced reality (ER) environment characterizedby home ER world coordinates, including a first diorama-viewrepresentation of a first ER environment, wherein the first diorama-viewrepresentation includes one or more of ER objects arranged in a spatialrelationship according to first ER world coordinates; detect, via theone or more input devices, a first input that is directed to the firstdiorama-view representation; and in response to detecting the firstinput, transform the home ER environment by: ceasing to display thefirst diorama-view representation within the home ER environment,transforming the spatial relationship between a subset of the one ormore ER objects as a function of the home ER world coordinates and thefirst ER world coordinates, and displaying, via the display device, thesubset of the one or more ER objects within the home ER environmentbased on the transformation.